The Microarchitecture of Superscalar Processors - Proceedings of the IEEE
نویسندگان
چکیده
Superscalar processing is the latest in a long series of innovations aimed at producing everyaster microprocessors. By exploiting instruction-level parallelism, superscalar processors are capable of executing more than one instruction in a clock cycle. This paper discusses the microarchitecture of superscalar processors. We begin with a discussion of the general problem solved by superscalar processors: converting an ostensibly sequential program into a more parallel one. The principles underlying this process, and the constraints that must be met, are discussed. The paper then provides a description of the speciJic implementation techniques used in the important phases of superscalar processing. The major phases include: 1 ) instruction fetching and conditional branch processing, 2 ) the determination of data dependences involving register values, 3) the initiation, or issuing, of instructions for parallel execution, 4 ) the communication of data values through memory via loads and stores, and 5) committing the process state in correct order so that precise interrupts can be supported. Examples of recent superscalar microprocessors, the MIPS RIOOOO, the DEC 21164, and the AMD K5 are used to illustrate a variety of superscalar methods.
منابع مشابه
The Microarchitecture of Superscalar Processors
Superscalar processing is the latest in a long series of innovations aimed at producing ever-faster microprocessors. By exploiting instruction-level parallelism, superscalar processors are capable of executing more than one instruction in a clock cycle. This paper discusses the microarchitecture of superscalar processors. We begin with a discussion of the general problem solved by superscalar p...
متن کاملClear and Present Tensions in Microprocessor Design
Microprocessor performance can be improved by increasing either the clock frequency (GHz) or the instruction level parallelism (IPC) or both. There is a complex and adversarial relationship between increasing GHz and increasing IPC. This talk revisits the old “speed-demons vs. brainiacs” debate (in some sense the reincarnation of the “RISC vs. CISC” debate) and recasts it as the tension between...
متن کاملApplication of instruction analysis/scheduling techniques to resource allocation of superscalar processors
This paper presents the development of instruction analysis/scheduling CAD techniques to measure the distribution of functional unit usage and the micro operation level parallelism (MLP), which together determine the proper functional unit allocation for superscalar microprocessors, such as the x86 microprocessors. The proposed techniques fit in the early design exploration phase in which the t...
متن کاملThe Design of the Microarchitecture of UltraSPARC TM-1 - Proceedings of the IEEE
The realization of a high pe@ormance modem microprocessor involves hundreds of person-years of conception, logic design, circuit design, layout drawing, etc. In order to leverage effectively the 5-10 millions of transistors available, careful microarchitecture tradeoff analysis must be pe@ormed. This paper describes not only the microarchitecture of UltraSPARC-I, a 167 MHz 64-b fourway supersca...
متن کاملA Trace Cache Microarchitecture and Evaluation
As the instruction issue width of superscalar processors increases, instruction fetch bandwidth requirements will also increase. It will eventually become necessary to fetch multiple basic blocks per clock cycle. Conventional instruction caches hinder this effort because long instruction sequences are not always in contiguous cache locations. Trace caches overcome this limitation by caching tra...
متن کامل